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Abstract 

Lattices possess elegant mathematical properties which have been previously used in the literature to show 
that structured codes can be efficient in a variety of communication scenarios, including coding for the additive 
white Gaussian noise (AWGN) channel, dirty-paper channel, Wyner-Ziv coding, coding for relay networks and so 
forth. We consider the family of single-transmitter multiple-receiver Gaussian channels where the source transmits 
a set of common messages to all the receivers (multicast scenario), and each receiver has coded side information, 
i.e., prior information in the form of linear combinations of the messages. This channel model is motivated by 
applications to multi-terminal networks where the nodes may have access to coded versions of the messages from 
previous signal hops or through orthogonal channels. The capacity of this channel is known and follows from 
the work of Tuncel (2006), which is based on random coding arguments. In this paper, following the approach 
introduced by Erez and Zamir, we show that lattice codes are capacity-optimal for this family of channels. The 
structured coding scheme proposed in this paper utilizes Construction A lattices designed over prime finite fields, 
and algebraic binning at the decoders to expurgate the channel code and obtain good lattice subcodes, for every 
possible set of linear combinations available as side information. As a corollary, we show that lattice codes based 
on Construction A can achieve any rate less than the capacity of the single-user AWGN channel for any sufficiently 
large prime field Fp, which is an improvement over the results available in the literature which require the prime 
p to grow as a function of the code length n. 


Index Terms 

Additive white Gaussian noise (AWGN), capacity. Construction A, Gaussian broadcast channel, lattice, mul¬ 
ticast, side information, structured codes. 


1. Introduction 

Information-theoretic results often rely on random coding arguments to prove the existence of good 
codes. Usually, the codebook is constructed by randomly choosing the components of each codeword 
independently and identically from a judiciously chosen probability distribution. While this technique 
is powerful, the resulting codebooks do not exhibit any structure that may be of practical interest. One 
such desirable structure is linearity, which allows complexity reductions at the encoder and decoder by 
utilizing efficient algebraic processing techniques. Further, in certain communication scenarios, coding 
schemes based on linear codes yield a larger achievable rate region than random code ensembles, as was 
shown by Komer and Marton [|T| for a distributed source coding problem. Structured coding schemes have 
been widely studied in the literature, especially for communications in the presence of side information 
and in multi-terminal networks. For an overview of structured coding schemes we refer the reader to Q, 
0 and the references therein. 

For communication in the wireless domain, structured codes can be obtained by choosing finite subsets 
of points from lattices Q, [Q-@. A lattice is an infinite discrete set of points in the Euclidean space that 
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are regularly arranged and are elosed under addition. Codes based on lattiees, known as (nested) lattice 
codes or Voronoi codes, are the analogues of linear eodes in wireless eommunieations. Effieient lattiee 
based strategies are known for a variety of oommunieation seenarios, sueh as for aehieving the eapaeity 
of the point-to-point additive white Gaussian noise (AWGN) ehannel [|7|-[[^, for dirty-paper eoding Q, 
[121, the Wyner-Ziv problem Q and eommunieation in relay networks [|13||-[ 16|, to name only a few. 

In this paper we present eapaeity-aehieving lattiee strategies for eommunieation in eommon message 
Gaussian broadeast ehannels, whieh we refer to as the multicast channel, where reeeivers have prior 
side information about the messages being transmitted. In partieular, we assume that the transmitter is 
multieasting K message symbols wi,..., wk from a finite field Fp, of prime size p, to all the reeeivers, and 
eaeh reeeiver may have coded side information about the messages: the prior knowledge of the values of 
(possibly multiple) Fp-linear eombinations of wi,..., wk- The number of linear eombinations available as 
side information and the eoeffieients of these linear eombinations ean differ from one reeeiver to the next. 
The eapaeity of this ehannel is known and follows from the results of Tuneel 0 . where the aehievability 
part utilizes an ensemble of eodebooks generated using the Gaussian distribution. 

The multiuser ehannel eonsidered in this paper is a noisy version of a simple speeial ease of index 


coding [18|-[20|. The index eoding problem eonsiders a noiseless broadeast link where eaeh reeeiver 


demands a subset of the souree messages and knows the values of some other subset as side information. 
A generalization of the index eoding problem in whieh the reeeivers have aeeess to linear eombinations 


of messages was studied reeently in [211, [22|. The speeifie instanee of index eoding where eaeh reeeiver 
demands all the messages from the souree eorresponds to a noiseless multieast ehannel and has a simple 


optimum solution based on maximum distanee separable (MDS) eodes [23|. When the ehannel is noisy, 
eapaeity-aehieving eoding sehemes based on struetured eodes are not available. In this paper we design 
optimum lattiee-based strategies for multieasting over the AWGN ehannel where the side information at 
the reeeivers is in the form of linear eombinations of souree messages. 

The ease of Gaussian multieast ehannel with eoded side information is motivated by applieations to 
multi-terminal eommunieation networks. It is known that signal interferenee in wireless ehannels ean be 
harnessed by deeoding linear eombinations of transmit messages instead of either treating interferenee 


as noise or deeoding interferenee along with the intended message [14[, [15|. When sueh a teehnique is 
used in a mutli-hop eommunieation protoeol, one eneounters reeeivers that have eoded side information 
obtained from transmissions in the previous phases. Similarly, in a network that eonsists of both wired 
and wireless ehannels, the symbols reeeived from wired links ean be utilized as side information for 
deeoding the wireless signals. If a linear network eode is used in the wired part of the network, then the 
side information is in the form of linear eombinations of the souree messages. 

Example 1 (Communciation in relay networks). Consider a wireless network with two base stations BSi 
and BS 2 , that hold message symbols Wi and W 2 , respeetively. The base stations are required to multieast 
wi and W 2 to four user nodes Ui,..., U 4 through the relay node R, see Fig. In the first phase of 
the protoeol, BSi and BS 2 eneode the data symbols wi and W 2 , and transmit the resulting eodewords 
simultaneously. By using the deeoding teehnique of eompute-and-forward [15[, U 3 reliably deeodes some 


linear eombination siWi -f S 2 W 2 , Si,S 2 G Fp, from the reeeived noisy superposition of the two transmit 
signals. On the other hand, R has a higher signal-to-noise ratio and sueeessfully deeodes both wi and W 2 
by behaving as a multiple-aeeess reeeiver. Further, there is no signal interferenee at Ui and U 2 , and these 
two nodes reliably deeode wi and W 2 , respeetively. 

We observe that the seeond phase of the protoeol is a eommon message broadeast ehannel with eoded 
side information at the reeeivers: the relay needs to multieast Wi,W 2 to four user nodes, the first three 
users Ui, U 2 , U 3 have prior knowledge of the linear eombinations Wi + 0 w 2 , Owi -f W 2 and SiWi -f S 2 W 2 , 
respeetively, while the fourth user has no sueh side information. ■ 


Example 2 (Wireless overlay for wired networks). Assume a network of noiseless wired links in the 
form of a direeted aeyelie graph, where the souree node Vs desires to multieast K independent messages 
Wi,... ,wk G Fp to a set of destination nodes V. The wireline network employs a traditional (scalar) 
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(a) Multiple-access phase: The relay R decodes both u>i 
and W 2 , while Ui, U 2 , U 3 decode wi,W 2 and siwi +S 2 W 2 , 
respectively. 


Fig. 1. A relay network where one encounters a common 


SiWi + S2W2 



(b) Multicast phase: R multicasts wi and W 2 to all four 
user nodes. Three of the users have the knowledge of some 
linear combination of wi and W2, while the fourth user has 
no side information. 

broadcast channel with coded side information at the receivers. 


linear network code [24|-[26|, i.e., the symbol transmitted on each outgoing edge of a node is an element 
of Fp generated as a linear combination of the symbols received on its incoming edges. At every destination 
node Wd £ the decoder attempts to recover the K message symbols from their Fp-linear combinations 
received on its incoming edges. Recovery is possible if and only if the number of linearly independent 
equations available at is K. It is known that the maximum number of linearly-independent equations 
that can be made available at is min{max-flow(f;d), K}, where max-flow(fd) is the maximum number 
of edge-disjoint paths from Vs to Vd, see p 6 | . It follows that multicasting is possible if and only if 
max-flow (fid) > K for every Vd G V. 

Now suppose there exist destination nodes with max-flow less than K, i.e., the communication demands 
are beyond the wireline network’s capacity. A solution to meet the demands is to broadcast a wireless 
signal from the source to fill the capacity deficiency of the wired network, see Fig. At each destination, 
the Fp-linear combinations obtained from the wireline network serve as side information to decode the 
wireless broadcast signal. ■ 

A special case of coded side information is the Gaussian multicast channel where each receiver has prior 
knowledge of the values of some subset of the K messages. The known capacity-achieving coding schemes 
for this special case are based on random coding using i.i.d. (independent and identically distributed) 
codewords [|T7|, Existence of lattice based capacity-achieving coding schemes were proved 

in p 8 |, [|3^ for the special case where the number of messages and receivers are two and each receiver 


has the knowledge of one of the messages. Constructions of binary codes for this channel were proposed 
in 


[-p5||. Explicit constructions of lattice codes were given in p 6 | |, p7| that convert receiver side 
information into additional apparent coding gain in the AWGN channel. Codes based on quadrature 

1 , explicit codes based on lattices and coded 


amplitude modulation were constructed in p 8 | , p9| . In 
modulation have been designed that perform within a few decibels of capacity when the number of 
receivers is two and each knows one of the two messages being transmitted. 

The objective of this paper is to prove that lattice codes achieve the capacity of the common message 
Gaussian broadcast channels with coded side information. We use the information-theoretic framework 
set by Erez and Zamir [[^ to this end. The proposed coding scheme uses an ensemble of Construction A 
lattices and the decoding scheme involves algebraic binning 0 where the receiver side information is 
used to expurgate the channel code and obtain a lower rate subcode. The set of linear equations available as 
side information may differ from one receiver to another, and hence, each receiver must employ a different 
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Fig. 2. Wireless multicast as an overlay for a wired network: the source node Vs encodes wi,, wk and broadcasts wirelessly to the 
destination nodes ,..., 114 in order to supplement the communication through the wired network. Each destination receives (possibly 
multiple) Fp-linear combinations of the source symbols through a linear (wireline) network code and uses this information to decode the 
wireless signal broadcast by Us. 


binning scheme for the same channel code. The coding scheme ensures that the binning performed at each 
receiver produces a good lattice subcode of the transmitted code. Following expurgation, each receiver 
decodes the channel output by minimum mean square error (MMSE) scaling and quantization to an infinite 
lattice. The algebraic structure of the coding scheme facilitates the performance analysis by decomposing 
the original channel into multiple independent point-to-point AWGN channels - one corresponding to 
each receiver - where each of the point-to-point AWGN channels uses a lattice code for communication. 
Unlike [[8|, where achievability in a point-to-point AWGN channel was proved using error exponent 
analysis, we provide a direct proof based only on simple counting arguments. 

As a corollary to the main result, we obtain an alternative proof of the goodness of lattice codes 
in achieving the capacity of the point-to-point AWGN channel. Previous proofs of this result presented 
in Q-[[T^ also use ensembles of lattices obtained by applying Construction A to random linear codes 
over a prime field F^; see also [40|, |411. While [[^ used primes p that were exponential in the code length 


n, [j^ and |10| improved this result to let p grow as and n^-^, respectively. The corollary presented 
in this paper further improves these results by enabling a choice of the prime p which is independent of 
the code length n but is a function only of the gap between the desired rate and the channel capacity. 

Lattices have been used to design powerful physical-layer coding schemes for wireless networks 
consisting of multiple sources, relays and destinations pA||-p^. In these networks information from 
the source nodes is conveyed to the destination nodes through relays over multiple hops and time slots. 
In each time slot, a set of nodes act as transmitters and every other node in their range observes a 
linear superposition of the transmitted signals perturbed by AWGN. Lattice coding schemes for these 
networks are designed such that each receiver can reliably decode the observed noisy superposition to a 
linear combination of source messages which it then proceeds to transmit in the next time slot. Every 
destination node decodes its desired messages once it collects sufficiently many linear combinations. In 
contrast, in this paper we consider a single hop interference-free transmission in a multicast channel 
consisting of one transmitter and multiple destination nodes that are aided by coded side information. 
Our objective is to design coding schemes that can utilize prior knowledge at these receivers rather than 
exploit wireless interference arising from multiple simultaneous transmissions, as often experienced in 
relay networks. 

The organization of this paper is as follows. We introduce the channel model in Section [Tl-AI and review 
the relevant background on lattices and lattice codes in Section |II-B[ In Section [nil we state the main 
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theorem, and deseribe the lattiee eode ensemble and eneoding and deeoding proeedures. We prove the 
main theorem and state a few eorollaries in Seetion and finally, we diseuss some eoneluding remarks 
in Seetion |Vl 

Notation: Matriees and eolumn veetors are denoted by bold upper and lower ease letters, respeetively. 
The symbol || ■ || denotes the Euelidean norm of a veetor, and (■)t is the transpose of a matrix or a veetor. 
The Kroneeker produet of two matriees A and B h A® B, li h the £ x £ identity matrix, and 0 is the 
all zero matrix of appropriate dimension. The symbol log(-) denotes logarithm to the base 2 and In(-) 
denotes logarithm to the base e. The expeetation operator is denoted by E. The symbol M\N' denotes 
the elements in the set M. that do not belong to the set M. 

II. Channel Model and Lattice Preliminaries 
A. Channel Model and Problem Statement 

We eonsider a (non-fading) eommon message Gaussian broadeast ehannel with a single transmitter and 
finitely many reeeivers, where all terminals are equipped with single antennas. The transmitter operates 
under an average power eonstraint and the reeeivers are affeeted by additive white Gaussian noise with 
possibly different noise powers. There are K independent messages Wi,... ,wk at the transmitter that 
assume values with a uniform probability distribution from a prime finite field Fp. Eaeh reeeiver desires 
to deeode all the K messages while having prior knowledge of the values of some Fp-linear eombinations 
of the messages wi,..., wk- Consider a generie reeeiver that has aeeess to the values Um, m = 1,..., M, 
of the following set of M linear equations 

K 

^ ^ ^m,ktOk tlmi m 1, . . . , M. 

k=l 

We will denote this side information eonfiguration using the matrix S = [sm,k] G F^^^, where eaeh row 
of S represents one linear equation. Any row of S that is linearly dependent on the other rows represents 
redundant information and ean be disearded with no loss to the reeeiver side information, and henee, with 
no loss to system performanee. Henee, without loss in generality, we will assume that the rows of S are 
linearly independent over Fp, i.e., rank(S') = M, and M < K. Note that the values of S and M ean be 
different aeross the reeeivers. A reeeiver with no side information is represented with an empty matrix 
for S (with M = 0 ). 

A reeeiver in the multieast ehannel is eompletely eharaeterized by its (coded) side information matrix 
S and the varianee of the additive noise. If we assume that the average transmit power at the souree 
is 1, then the signal-to-noise ratio at this reeeiver is SNR = ^/a^. We will denote a reeeiver by the pair 
(S', (j^), where S is any matrix over Fp with K eolumns and linearly independent rows, and > 0. Note 
that uncoded side information, i.e., the prior knowledge of the values of a size M subset of wi,..., wk, 
is a speeial ease, and henee, is eontained within the definition of our ehannel model. 

Example 3 . Consider a souree transmitting K = 3 symbols, wi, 102,103, from the finite field F 5 = 
{0,1, 2, 3,4}. A reeeiver that has prior knowledge of the value of 102 has side information matrix 

S = (0 1 0) . 

This eorresponds to the equation Owi + lw 2 + Ows, and the number of linearly independent equations at 
this reeeiver is M = rank(S) = 1. 

Now eonsider another reeeiver that has the knowledge of the values of the following three equations: 
wi + Aw 2 + 3w 3, Awi + 3 t <;2 and 2i<;i +W 2 + Sws. In matrix form, this side information is represented by 


14 3 
4 3 0 
2 13 
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where the three rows represent the three equations, in that order. The first row of this matrix is equal to 
the sum (over F 5 ) of the second and third rows, and hence, the side information from the first equation 
is redundant and can be discarded. Since the remaining two rows are linearly independent, the side 
information at this receiver can be represented by the following matrix that consists of these two rows. 

The number of linearly independent equations at this receiver is M = rank(S) = 2. ■ 


From elementary linear algebra we know that if the values Um of M linearly independent combinations 
of the variables wi,, wk are given, then the set of all possible solutions of (wi,..., wk) is a coset of 
a {K — M) dimensional linear subspace of F^. Since the apriori probability distribution of tui,... ,wk 
is uniform, we conclude that, given the side information values Um, rn = 1 ,..., M, the probability 
distribution of (wi,..., wk) is uniform over this coset. Using the fact that the number of elements in the 
coset is p^~^, we observe that the conditional entropy of (wi,... ,wk) given the side information is 

H{wi,...,wk\ui,...,um) = \og{p^~^) = {K - M)\ogp. (1) 


Suppose we want to transmit, on the average, one realization of {wi,... ,wk) in every k, uses of the 
broadcast channel. The transmission rate of each message is i? = ^/nlogp b/dim (bits per real dimension 
or bits per real channel use). 

For the simplicity of exposition, we consider only the symmetric case where all the K messages 
are required to be transmitted at the same rate R. The general scenario, where the messages are of 
different rates, can be reduced to the symmetric case through rate-splitting: if there are K' messages with 
transmission rates ri,... ,rx', respectively, then by splitting each of these original sources into multiple 
virtual sources, one can generate a set of K sources {K > K') such that their rates i?i,..., Rk are as 
close to each other as required. 

We will assume that the encoding at the transmitter is performed on a block of ^ independent realizations 
of the K message symbols, i.e., the source jointly encodes K message vectors Wi,... ,Wk G F^. The 
transmitter uses an n-dimensional channel code A’ C together with a function 


p : Fj X ■ ■ ■ X Fj ^ A 

to jointly encode the K message vectors. The number of codewords in A is p^^, and we will assume 
that the codebook A satisfies the per-codeword power constraint 

llrrlP 

< 1 , for all xeX. ( 2 ) 

n 

The average number of channel uses to transmit each realization of (wi,..., wk) is k = '^ji. The resulting 
rate of transmission of each of the K messages is 

!^ = ilogp b/dim, 

n n 

The sum rate of all the messages is ^^/niogp b/dim. 

The side information at (S', over a block of ^ realizations of the K message symbols is of the form 
Yl!k=i Sm,kWk = Um, m = 1 ,..., M, where M = rank(5) and Um G F^. This side information allows the 
receiver to conclude that the transmitted codeword must belong to the following subcode of A, 


A(S,Mi, ... ,Um) 


p{wi,... ,Wk) 


Wi,...,WKe¥i 


K 

^ ^ Sm,kUlk — Um for Uh 
k=l 



(3) 


The optimal decoder at (S', decodes the channel output vector to the nearest codeword x of this 
subcode, and the error probability at this receiver is the probability that the estimated message tuple 
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X{S,ui,... ,um) 


Fig. 3. Each receiver {S,a^) of the multicast channel uses its own side information {S,Ui,... ,um) to expurgate the channel code X and 
obtain a subcode X{S,Ui,... ,Um). Note that the resulting subcodes can be different across the receivers. In order to achieve the capacity 
of the multicast channel, we require that each of these expurgated codes be good for channel coding over the point-to-point AWGN channel. 


p~^{x) is not equal to the transmit message (rui,... ,Wk)- In order to aehieve the optimal performanee 
at a given reeeiver (5, cr^), we thus require that the expurgated eode X{S,Ui,... ,Um) be a good ehannel 
eode for the point-to-point AWGN ehannel. In the multicast channel that consists of multiple receivers, 
the side information matrix S can vary from one receiver to the next, and hence, the expurgated codes 
can be different at each receiver, see Fig. Hence, a capacity-achieving channel code X is such that the 
resulting expurgated code at every receiver is a good channel code for the AWGN channel. 

Problem Statement 

Problem Setup: Consider a common message Gaussian broadcast channel with single transmitter and 
N receivers. The transmitter desires to multicast K independent messages from a prime field Fp subject 
to the unit power constraint @ on the transmit codeword. Each of the N receivers (51, af),..., (Sat, ct^) 
has coded side information corresponding to the side information matrix Si E i = 1,... , N, and 

experiences an additive white Gaussian noise of variance = N. Without loss of generality, we 
assume that each of the side information matrices Si has linearly independent rows, i.e., rank(5j) = Mj. 
Using the information-theoretic arguments of OI). which is based on the average performance of an 
ensemble of randomly generated codebooks, it can be shown that the (symmetric) capacity of this multicast 








channel is 


C 


min 


1 

(K - Mi) 




(4) 


The proof of this result is similar to the proof of Theorem 6 of whieh eonsiders a discrete memoryless 
eornmon message broadeast ehannel where the side information at each receiver is, in general, a random 
variable jointly distributed with the souree messages (tci,... ,wk)- A sketeh of the proof that C is the 
eapaeity for the Gaussian multieast ehannel with eoded side information at the reeeivers is given in 
Appendix 

Problem Statement: Let e, C > 0 be fixed positive real numbers and let i? < C — e. We seek to determine 
whether there exists a lattiee eode for the multieast channel with eoded side information at the reeeivers 
that transmits eaeh of the K messages with rate at least {R — e) sueh that the probability of deeoding 
error at eaeh of the N reeeivers is at the most 

In this paper we answer the above stated problem in the affirmative under the assumption that the prime 
field Fp is suffieiently large. In partieular, we prove the existenee of a lattice eode with the said properties 
when the prime p satisfies the inequality p > max (2"/"* — 1)“^2“^}. Unlike the eapaeity Q whieh 

holds for any value of p, our result on the optimality of lattice eodes requires that p vary with the toleranee 
e. The larger the gap to eapaeity e, the smaller is the size requirement on the prime field ¥p. 


B. Lattice Preliminaries 

We now briefly reeall the neeessary properties of lattiees and lattiee eodes, and establish our notation and 
terminology. The material presented in this seetion eonsists of standard ingredients used in the literature, 
and is mainly based on Q, Q, [j^, [ |4^ . 

1) Lattices and Lattice Codes: Throughout this manuseript we eonsider n-dimensional lattiees A with 
full-rank generator matrix. The elosest veetor lattiee quantizer eorresponding to A is denoted by the 
funetion Q\ : —)■ A, and the volume of its (fundamental) Voronoi region V(A) = Q^^(O) is denoted 

by Vol(A). For any A G A, A + V(A) is the set of all points in M" that are mapped to A under Qa, and 
it has the same volume Vol(A) as V(A). For any two distinet lattiee points Ai ^ A 2 , the sets Ai + V(A) 
and A 2 + V(A) are disjoint. The modulo-k operation, defined as [x] mod k = x — Qx{x), satisfies the 
following properties 

[x] mod A G V(A) for all x G M"', 

[xi + X 2 ] mod A = [ [xi] mod A + X 2 ] mod A for all Xi,X 2 G ] 

[x] mod A = 0 if and only if x G A. 

We will denote the n-dimensional ball of radius r with eenter s G MR as B{s,r) 


and 


(5) 

( 6 ) 


i.e. 


B{s, r) = {x G 


||x — s|| < r} , 


and the volume of a unit-radius ball in n dimensions by Vn- It follows that the volume of B{s,r) equals 
I4r”. The covering radius of the lattiee A is denoted by rcov(A) and the effective radius of A by refr(A). 
We reeall that refr(A) < rcov(A) and 


reff(A) = 


m 


Rogers [44| showed that for every dimension n there exists a lattiee A sueh that 


.(A) 


< ^cn (Inn) 2j 


(7) 


( 8 ) 


refr(A) 

where c is a eonstant. Note that the right hand side of the above inequality converges to 1 as n —>■ cx). A 
sequenee of lattiees of inereasing dimension n is said to be Rogers-good if 1- Rogers’ result @ 

shows that sueh a sequenee exists (see also p^). 
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Let Ac C A be a pair of nested lattices and d G V(Ac) be a fixed vector. A (nested) lattice code or a 
Voronoi code (A — d)/Ac is the set (A — d) mod Ac obtained by applying the mod Ac operation on the 
points of the lattice translate A — d. The code consists of all the points in A — d that lie within the Voronoi 
region of Ac, i.e., (A — d) fl V(Ac). The lattice Ac is called the coarse lattice or the shaping lattice, A 
is called the fine lattice or the coding lattice, and d is the dither vector. The cardinality of this code is 
|(A —d)/Ac| = |A/Ac| = '^°*Ac)/vo 1 (a), and every codeword point x G (A —<i)/Ac satisfies ||x|| < rcov(Ac). 
Note that A/Ac is a lattice code with zero dither. 

2) Lattice Codes from Linear Codes over a Finite Field: In this subsection we briefly describe the 
method proposed in [15| to construct a pair Ac C A of nested lattices, and recall its relevant properties. 
This construction uses a coarse lattice Ac and a linear code ^ to generate a fine lattice A such that 
|A/Ac| = |^|. 

Let gf) denote the natural map that embeds Fp = {0,l,...,p —1} into Z. When applied to vectors, gf) 
acts independently on each component of a vector. Let C F" be a linear code of rank L, 1 < L < n, 

{Gw\we¥^}, 


where G is the nx L generator matrix with full column rank, and w is the message encoded to The set 
g{^) +p1F obtained by tiling copies of gf^) at every vector of pTA is a lattice in ML and is known as the 
Construction A lattice of the linear code ^ Q. Note that the number of points in (y'(^) + plA contained 
in the Voronoi region of the lattice pTA is |^| = p^. We obtain A by scaling down the Construction A 
lattice by p~^ and transforming it by the generator matrix Be of Ac 


A = Bep-^ {g{^) + pIA) = Bcp-^g{^) + BeZ" = Bcp-^g{^) + Ac. 


Since ^ contains the all zero codeword, it follows that A D BcP~^g{0) + Ac = Ac. We observe that 
applying the transformation BcP~^ to the lattice pZ'^ (instead of the lattice gfifi") + pZ") generates Ac 
(instead of A). Hence, A/Ac has the same algebraic structure as that of (gflf) +pZ”)/pZ’^, which in turn, 
is equivalent to the linear code In particular, 

|A/Ac| = =p^. (9) 


The following lemma provides an explicit bijection between the message vectors w G F^ encoded by ^ 


and the points in the lattice code A/Ac 
below for completeness. 


This result, which is originally from [15 Lemma 5], is proved 


Lemma 1. The map w —)■ [BeP ^g (Gw)] mod Ac is a bijection between F^ and A/Ac. 

Proof: From we know that |A/Ac| = |Fp| = p^. Hence, it only remains to show that no two 
distinct messages Wa and are mapped to the same point in A/Ac. Assuming the contrary, we have 
[BcP~^g{GwA)] mod Ac = [BcP~^g{GwB)] mod Ac. Using @ and we obtain 

BcP~^g{GwA) - BcP~^g{GwB) G Ac. 


Multiplying both sides by pBf^, we obtain g(GwA) — giGwB) G pZ”. Reducing this result modulo-p, 
we have Gw a — Gwb = 0 over Fp. Since this implies G{wa — Wb) =0 over Fp while Wa — 7 ^ 0 and 

G has full column rank, we have arrived at a contradiction. ■ 

In order to prove capacity achievability, we will rely on random coding arguments to show the existence 
of a good choice of G. As in [[T5|, we will assume that G is a random matrix chosen with uniform 


probability distribution on F^^^. The following result is useful in upper bounding the decoding error 
probability over the ensemble of random codes. 


in Fp^'^. Then [SeP 
code p~^Ac/Ac. 


Lemma2([|15|, [|41|, [|45|). Letw G Fp\{0} be a given non-zero vector, and letG be uniformly distributed 


g{Gw)] mod Ac is uniformly distributed over (p ^Ac) fl V(Ac), i.e., over the lattice 
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III. Lattice Codes eor the Common Message Gaussian Broadcast Channel with Coded 

Side Information 

We will assume that the number of messages K and a design rate R are given, and show that there 
exist good lattice codes of sufficiently large dimension n that encode K messages over an appropriately 
chosen prime field Fp at rates close to R b/dim. In order to rigorously state the main result, we consider 
a fixed non-zero tolerance e > 0 that determines the gap to capacity. 


Theorem 1 (Main theorem). Let the number of messages K, design rate R and tolerance e > 0 be 
given. For every sufficiently large prime integer p, there exists a sequence of lattice codes of increasing 
dimension n that encode K message vectors over Fp such that the rate of transmission of each message 
is at least {R — e) b/dim and the probability of error at a receiver {S, a^) decays exponentially in n if 


^ log (1 -f ^ ) >{R + e){K- rank(S')). 


cr^ 


( 10 ) 


To prove Theorem we utili z e the lattice code ensemble introduced in |T^; see Section II-B2 of this 
paper. A Rogers’ good lattice is chosen as the coarse lattice Ac. The fine lattice A is obtained from the 
generator matrix Be of the coarse lattice Ac and a linear code ^ over a large enough prime field Fp using 
the construction described in Section ITI-B2I 

The multicast channel considered in Theorem reduces to the traditional single-user AWGN channel 
if the number of messages K = 1, and the multicast channel consists of one receiver with an empty side 
information matrix S, i.e., rank(S') = 0. Hence, Theorem provides an alternative proof of the existence 
of lattice codes that achieve the capacity of the single-user AWGN channel, and we have the following 
corollary. 


Corollary 1. Consider a single user AWGN channel where the input is subject to unit power constraint 
and the noise variance at the receiver is o^. Let e > 0 be any constant and let 


R < 




— e. 


For every sufficiently large prime integer p, there exists a sequence of lattice codes of increasing dimension 
n constructed from linear codes over Fp (as described in Section H-Bl) such that the rate of each lattice 
code is at least R — e and the probability of decoding error at the receiver decays exponentially in n. 


The relation of Corollary to existing results on the optimality of Construction A based lattice codes 


in single-user AWGN channel is described in detail in Section IV-D2 


In the rest of this section we describe the construction of random lattice codes, and the encoding and 
decoding operations used to prove Theorem We provide the proof of the Theorem in Section IV 


A. Random lattice code ensemble 

1) Prime p: Given the design rate R, number of messages K and tolerance e > 0, we choose p as any 
prime integer satisfying 

p > max{22-^^, (2'/" - l)-^2-^} . (11) 

The coding schemes of this paper are based on Construction A lattices which are obtained by lifting linear 
codes over Fp to the Euclidean space M". The generator matrices of these Fp-linear codes are constructed 
randomly, and the first constraint in ( [TT] ), viz. p > 2^^^, will allow us to show that these randomly 
constructed generator matrices are full-ranked with probability close to 1. 

The proof of Theorem given in Section |IV] involves the derivation of an upper bound on the probability 
of decoding error averaged over an ensemble of lattice codes derived from Construction A. We will use 
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the inequality p > (2'/'* — 1) ^2 ^ from ( [TT] ) to show that this upper bound is exponentially small in 
dimension n. Note that this inequality implies 


for any integer M satisfying Q < M < K — 1. Rearranging the terms in the above inequality we obtain 


1 

^2(R+e){K-M) 


+ 1 < 


( 12 ) 


2) Message length 1: Onee p is fixed, we ehoose I as the largest integer that satisfies 


P 

— \ogp < R. (13) 

n 

The left-hand side in the above inequality is the aetual rate at whieh the lattiee eode eneodes eaeh message, 
while R is the design rate. The differenee between the two is at the most 


n 


log p -log p 

n 


\ogp 

n 


whieh eonverges to 0 as n —>■ cx). It follows that the eode rate ^jn loggP tends to the design rate R as 
n —)■ oo, and henee, ^jn log 2 P > R — e for all suffieiently large n. 

3) Coarse Lattice A^.' From @ in Seetion [ri-Bl[ we know that for a given e > 0 and for all suffieiently 
large n, there exists an n-dimensional lattiee Ac sueh that 


^cov(Ac) 2^/4 
^eff (Ac) 

We will ehoose sueh a Rogers-good lattiee as Ac, and seale it so that 


?’cov(Ac) = ^/n. 

It follows that reff(Ac) > 2“'/'* rcov(Ac) = Using the definition of the effeetive radius Q, we 

arrive at the following lower bound on the volume of the Voronoi region of Ac 

Vol(Ac) = K (Ac) > K (14) 


4) Fine Lattice A.- The fine lattiee is obtained by the construetion of [15| deseribed in Seetion [II-B2 


The length of the linear eode ^ is n, and its rank L = KP is the number of message symbols to be 
eneoded by the lattiee eode. Note that this requires that KP < n he true. Using ( [T3] ) and the property 
p > we have 

= (15) 

logp 2KR 2 


whieh ensures that KP < n. If G G is the generator matrix of then A = BcP + Ac. 

We will ehoose G uniformly random over the set of all n x KP matriees of ¥p, resulting in a random 
ensemble of fine lattiees A. 

5) Dither vector d: We will rely on random eoding arguments to prove the existenee of a translate d 
sueh that the eode (A—d)/Ac performs elose to eapaeity. We will assume that d is distributed uniformly in 
V(Ac) and is ehosen independently of G. This random dither d is usually viewed as a eommon randomness 
available at the transmitter and the receivers Q. Note that ||</|| < rcov(Ac) = ^/n. 
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Wi 


Wk- 


OJ 

Pi 

< 1 ^ 

a 

o 

p 

o 

O 


w 


Gw 


BcP ^g{-) 

^(±>- 

[•] mod Ac 


X 


d 


Fig. 4. The encoding operation at the transmitter that maps the messages wi,... ,Wk to a point in (A — d)/Ac. 


B. Encoding 

We will now describe the encoding operation p at the transmitter that maps the message vectors 
(wi, ... ,Wk) G Fp X ■ ■ ■ X Fp to a codeword a; G (A — d)/Ac. The encoder first concatenates the K 
messages into the vector w = (tnj, • • ■ ,w]^y, encodes in to a codeword in the linear code and maps 
it to a point t G M” using Construction A as follows 


t= [BcP ^g{Gw)~\ mod Ac- 


(16) 


From the discussion in Section III-A we know that BcP ^g(Gw) G A, and hence, t G A/Ac. Finally, the 
transmit codeword x is generated by dithering t. 


X = [t — d] mod Ac = [BcP ^g (Gw) — d] mod Ac. (17) 

This sequence of operations is illustrated in Fig. Note that since rcov(Ac) = y/n, each codeword x 
satisfies ||x|| < rcov(Ac) = y/n, and hence, the power constraint 



n 


It is straightforward to show that the dithering operation ( [T7| ) is a one-to-one correspondence between 
t G A/Ac and x G (A — <i)/Ac. Further, from Lemma we know that ( fT^ is a bijection between the 
message space F^^ and the undithered codewords A/Ac if G is full rank. Hence, to ensure that no two 
messages are mapped to the same codeword, we only require that the random matrix G be full rank. It 
can be shown that (see p3| ) 

P (rank(G) < Ki) < 


We will only require a relaxation based on the above inequality. From ( [T5] ), we have Kl < '^/ 2 . Similarly, 
since p is a prime integer, we have p >2, and hence. 


P (rank(G) < Ki) < 


(18) 


C. Decoding 

The receiver employs a two stage decoder: in the first stage the receiver identifies the subcode of 
(A — d)/Ac corresponding to the available side information, and in the second stage it decodes the 
channel output to a point in this subcode. 
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1) Using Side Information to Expurgate Codewords: The side information at over a bloek of 

realizations of the K messages is of the form 


K 




(19) 


k=l 


The reeeiver desires to identify the set of all possible values of the message veetor w = (w{, • • ■ 
that satisfy ( [T9l ). Using the notation u = (m{, ■ ■ ■ ^ the side information ( [T9| ) ean be rewritten 

eompaetly in terms of w and u as 

(S' ®I()w = u, (20) 

where 0 denotes the Kroneeker produet of matriees and is the i x i identity matrix over Fp. Observe 
that ( [ 20 | ) is an under-determined system of linear equations, and the set of solutions is a eoset of the null 
spaee of S ® Ii. Let As G ^ rank {K — M)i matrix sueh that (S (8) Ie)As = 0, i.e., the 

eolumns of As form a basis of the null spaee of S 0 Then the set of all solutions to p0| ) is 

v + {Asw\w , (21) 

where v is the eoset leader. From ( [T^ , we eonelude that the undithered eodeword must be of the form 

t = [BcP~^g {Gv + GAsw)\ mod Ac for some w G (22) 

We will now use the property of g{-) that for any a ,6 G F”, g{a + b) = g{a) + g{b) mod p. Therefore, 
g{Gv + GAsw) = g{Gv) + g{GAsw) + pc for some c G Using this in ([2^, we obtain 


(23) 


t = [BcP ^g {Gv) + BcP ^g (GAsw) + Bcc] mod Ac 
= [BcP~^g (Gv) + [BcP~^g (GAgiu)] mod Ac + [BcC] mod Ac] mod Ac 
= [BcP~^g (Gv) + [BcP~^g (GAsw)] mod Ac] mod Ac, 


where we have used (|^ and the faet that BcC G Ac. Sinee the reeeiver knows v, the eomponent of t 
unavailable from the side information is 


i= [BcP ^g{GAsw)] mod Ac 


(24) 


Let C Fp be the subeode of 'if with generator matrix GAs, and As be the lattiee obtained by applying 
Construetion A to ^s and transforming it by BcP~^, i.e.. 

As = B^p-^gif^s) + Ac. 

Using GAs instead of G in Lemma we see that i G As/Ac and that ( [24| ) is a one-to-one eorrespondenee 
between w G and i G As/Ac as long as GAs is full rank. Together with ( [T7| ), p3] ), and ( [24| ), 

we eonelude that the transmit veetor x belongs to the following lattiee subeode of (A — d)/A^, 

(As +BcP~^g (Gv) -d) /Ac = { [i + BcP~^g(Gv) -d] mod Ac | t G As/Ac} . (25) 

The deeoding problem at the seeond stage is to estimate i, or equivalently w, from the ehannel output. 
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a 



t 


Fig. 5. The decoding operation at the receiver (S, a^). The vector v and the lattice Ag are determined using the side information available 
at the receiver. 


2) MMSE Scaling and Lattice Decoding: Let the channel output at the receiver (S', cr^) he y = x + n, 
where n is a Gaussian vector with zero mean and variance cr^ per dimension. The received vector is scaled 
by the coefficient a, resulting in 

ay = ax + an = X + an — {1 — a)x. (26) 


This MMSE pre-processing improves the effective signal-to-noise ratio of the system beyond the channel 
signal-to-noise ratio and allows the lattice decoder to perform close to capacity [[^, [46|. Let 

z = an — {1 — a)x 


be the effective noise term in ( |261 ). Using the facts that x and n are independent, ||x|| < y/n, and n has 
zero mean, we have 

E ll^ll^ = (1 — IE ||a:||^ + a^E ||n||^ — 2a(l — a) Ex'^n < (1 — a)^ n + a^a'^ n, 

where E is the expectation operator. The choice of a = minimizes this upper bound and yields 


Eiizir < 


na 


1 + 0-2 
2 


2 ’ 


1 + cr 

which is less than the Gaussian noise power E||n|p = na^. In the rest of the paper we will assume that 


a = 


1 + 0-2 


and use the notation 


^2 _ ^ 

^ 1 + ■ 


The lower bound ( fTO] ) on signal-to-noise ratio can be rewritten in terms of al as 

^2 ^-2{R+e){K-M) _ 


(27) 


(28) 


From ( [25] ), we know that x = i + B^p ^g{Gv) — d + Xc for some Ac G Ac. After MMSE scaling, the 
decoder removes the contributions of the dither d and the offset BcP~^g{Gv) from ay to obtain 

y' = ay- B^p~^g{Gv) + d 
= t -\- Ac ~\~ z. 

The decoder proceeds by quantizing y' to the lattice A5 and reducing the result modulo Ac. If the noise 
z is sufficiently ‘small’, then this sequence of operations will yield 


[Qas(!/')] inod Ac = [QAs(t + Xc+ z)] mod Ac = [i + Ac] mod Ac = i. 


(29) 


Given t, the receiver uses ( |2^ to obtain the undithered codeword t, and hence the message vector 
(ml,... ,w]^y, as follows 


t = [BcP ^g(Gv) -|-f] mod Ac. 
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To conclude, the deeoder obtains the estimate i of the undithered eodeword t from the reeeived veetor y 
as 


t = 


mod Af. 


[Qas(?/ 0] modAc + BcP ^g{Gv) 

= [Qhsiv') + BcP~^g{Gv)] mod Ac 
= [Qas {(^y - BcP~^g{Gv) +d) + BcP~^g{Gv) ] mod Ac, 


whieh shows that the mod Ac operation arising from ( |29l ) ean be ignored. The steps involved in the 
deeoding operation are illustrated in Fig. 

Note that the effeetive information veetor w is not eneoded in the point i G As, but is eneoded in the 
eoset f+Ac. The error event for this deeoder is (i/) ^i+ Ac, i.e., Qas( i+^c+z) ^i+ Ac, whieh is 
equivalent to QAg{z) ^ Ac. Henee, a deeoding error oeeurs if and only if 2 is eloser to a point in A5\Ac 
than any veetor in the eoarse lattiee Ac, i.e., if and only if 


E: Qas{z) G A5\Ac. 


IV. Proof of Main Theorem 


(30) 


In this seetion we first state and prove two teehnieal lemmas (Seetion IV-A), use these lemmas to show 
that the error probability at a given fixed reeeiver (S, a^) is small (Seetion IV-B), and then eomplete the 


proof of the main theorem by showing that the error probability at every reeeiver of the multieast ehannel 
is simultaneously small (Seetion [IV-C| ). Finally, we state some important eorollaries of the main theorem 
(Seetion |IV-D[ ). 


A. Technical Lemmas 

The first result, whieh is a direet generalization of [|^ Lemma 1] and [|T^ Lemma 2.3], gives an upper 
bound on the number of lattiee points lying inside a ball. 

Lemma 3. For any s G M", r > 0 and any n-dimensional lattice Ac, 

I Ac n B{s, r) I < ^ (rcov(Ac) + r)” , 

where Vn is the volume of a unit ball in 

Proof: Let TZ = (Ac fl B{s,r)) + V(Ac) be the set of all points in M" that are mapped to one of 
the points in Ac fl B{s,r) by the lattiee quantizer Qa^- Sinee 7 ?. is a union of the pairwise disjoint sets 
A + V(Ac), A G Ac n i3(s, r), and sinee eaeh of these sets has volume Vol(Ac), we have 

Vol(7^) = Vol(Ac) |Acni3(s,r)|. (31) 

Using the faet that V(Ac) C i 3 ( 0 , rcov(Ac)), we have the following 

7 ?. = (Ac n B{s, r)) + V(Ac) C B{s, r) + V(Ac) C B{s, r) + S( 0 , rcov(Ac)) C B{s, r + rcov(Ac)), 


where the last step follows from the triangle inequality. Consequently, we have an upper bound on the 
volume of TZ, Vol(7?.) < Vol {B{s,r + rcov(Ac))) = I4(r + rcov(Ac))". Using this result with ( [3T] ) proves 
the lemma. ■ 

As in Q, p0| , [^, we will rely on the faet that, with very high probability, the norm of the noise z 
is not mueh larger than ^/naf The probability that the effeetive noise z is ‘large’ is exponentially small 
in n. The proof of this result is given below. 

Lemma 4. Let x be uniformly distributed in V(Ac) and 5 > D be any positive number. Then 


> nal{l + 5)) < exp 


n(5-ln(l + 5)) 


+ exp 




(32) 
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Proof: We will prove ( |32l ) for every fixed realization of x in V(Ac), which shows that the statement 
of the lemma is true for any distribution of x on V(Ac). In the rest of the proof we will assume that 
X G V(Ac) is an arbitrary fixed vector and n is Gaussian distributed. Using ||a;|p < r'^^^{Ac) = n, we have 

||z||^ = \\an — (1 — (y)x\\^ = tt^||n||^ + (1 “ — 2 q!(1 — a)x'^n 

< a^llnlp + (1 — aYn — 2a(l — a)x'^n. 


Hence, we have the following upper bound 

P (||z|p > + 5)) < P (a^||n|p + (1 — a)^n — 2q;(1 — Q)x'^n > naj^l + 5)) . 

From the definition ( [27] ) of a^, we have nal{l + 5) = na'^a‘^{l + 5) +n(l — Q!)^(1 + 5). Hence, the above 
upper bound corresponds to the event 

a^||n||^ + (1 ~ aYn — 2a{l — a)x'^n > na‘^a‘^{l + 5) + n(l — a)^(l + 5). (33) 


The event (33) occurs only if at least one of the following two events occur 


Ea- > na‘^a‘^{l + 5), or 

Eb- (1 — aYn — 2a{l — a)x'^n > n(l — a)^(l + 5). 


(34) 


Therefore, P(||z|p > nal{l + 6)) < P{Ea^Eb) < P{Ea) +P{Eb)- We will now individually upper bound 
P{Ea) and P{Eb), and thereby complete the proof. 

A rearrangement of terms in ( |34| ) yields P{Ea) = P (|| 4 np > n(l + 5)). This is the probability that a 
Gaussian vector with unit variance per dimension lies outside the sphere of squared radius ?7,(1 + 5). The 
following is a well known upper bound on this probability (see [471) 

n{5 — ln(l + 6)) 


E‘{Ea) < exp 

The event Eb is equivalent to —2a(l — a)x'^n > n(l — a)^(5. Using a = we can show that this 

. Since x^n is a zero mean Gaussian random variable with variance a^lIxlP, we 


is same as x^n < 
have 


ndcj^ 


P{Eb) = P a;Tn < 


n8a^ 


= Q 


na5 


where Q(-) is the Gaussian tail function. Using ||a;|| < rcov(Ac) = ^/n and the Chemoff bound Q{y) < 
2 

exp(—^), we arrive at 


P(^^b) < exp 




This completes the proof. 


B. Error probability at a single receiver 

In this subsection we derive an upper bound on the decoding error probability Ps at a receiver (iS, a^) 
when averaged over the ensemble of lattice codes generated by choosing G uniformly over and d 

uniformly over V(Ac). 

The following result from [[^, known as the Crypto lemma, captures an important characteristic of 
random dithering. 

Lemma 5 ([[^). Let t G V(Ac) be any random vector. Ifd is independent oft and is uniformly distributed 
over V(Ac), then x = \t — d\ mod A^ is independent oft and uniformly distributed over V(Ac). 
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The property that the transmit vector x is statistically independent of t implies that the effective noise 
z = an — {1 — a)x is independent of the transmit message. This facilitates the error probability analysis 


through the observation that the error event ( [30| ) is statistically independent of t. 

For distinct messages w to be mapped to distinct points t, we require that GAs be full rank. Since As 
is full rank, this is same as requiring that G be full rank. Apart from the event 8 : Qks{z) G As\Ac, we 
assume that the decoder declares an error whenever the event 

g -. rank(G < Ki) 

occurs. Hence, the error probability P5 at the receiver {S,a'^) satisfies 

Ps = P(6 ^U^) <P(e;) + P(^). (35) 

From ( [T^ , we already know that P(^) is exponentially small in n. 

Using the given design tolerance e, we set 6 = 2'/^ — 1, which is positive if e > 0. Let = \/n(l + 
be the radius of the typical noise vector and = i3(0,r^). Then, 


P(^) = P(z G BJ eBJ + P(z ^ BJ P(^k ^ BJ 
<P(8jz e BrJ + P(z ^ BrJ. 


(36) 


Lemma 1^ provides an exponential upper bound on P{z ^ Br^)- In the following theorem we show that 
P{8\z G Br^) is also exponentially small in n. The proof of this result uses the technique of Q, [10| to 
bound the number of lattice points lying in an n-dimensional ball. 

Theorem 2 . For any receiver (5, a^) with Y2log (l + > (i? + e) {K — rank(5)), and for all large 

enough n, 

P{8\ze BrJ < 

when averaged over the ensemble of random lattice codes. 


Proof: From ( [30l ), we note that the decoder is in error when 2 is closer to some coset t'T Ac, with t' G 
As/Ac and t' f 0, than any point in Ac. The number of competing cosets is jA^/Ac \ {0}| = — 1, 

and we index them using the non-zero vectors vf G To each vf, we associate the coset 

corresponding to the coset leader 


t' = \BcP ^g{GAsw')\ mod Ac. 


(37) 


Since G is random, the coset leader F associated with a given w' is a random vector. Given that z G B^^ 
and 0 G Ac, the Euclidean distance between z and Ac is at the most 0|| < Vz. Hence, for an error event 
to occur, there must exist a coset t' + Ac at a distance less than from z, i.e., \{F + Ac) fl B{z, rz)\ 7^ 0. 
Indexing the cosets by w', we have 

/ \ 


P{8\z G < P 


IJ { |(f' + Ac) ni3(z,r^)| ^ 0 } 


y w'sf: 




Z eBr 




Z eBr 

z eBr 




(38) 


< P( |(f' + Ac)nS(z,r,)l ^0 

< E(|(t' + Ac)ni3(z,r,)l 

where the second inequality follows from union bound, and the last inequality follows from the observation 

1 { I (f' + Ac) n B[z, rz) \ 7^ 0 } < \ {F + Ac) n B{z, Cz) \, 
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where !{■} is the indieator funetion. Note that the expeetation operation in ([3^ is with respeet to the 
random veetor t' as well as the effeetive noise z. 

The matrix Ag has full eolumn rank, and henee, Asw' ^ 0 for every w' ^ 0. Using ( [37| ) and applying 
Lemma]^ we see that If is uniformly distributed in p“^Ac/Ac = p“^AcnV(A(.). Further, from Lemma|^the 
distribution of if is independent of z. Henee, the probability mass funetion of f equals | (p“^Ac) /Ac|“^ = 
p~'^ over every element of the set (p“^Ac) /Ac. Using this result, we obtain 

V{S\z e Br,) < E E F{t' = a)E {\{a + Ac) n B{z,r^)\ \ z e 

w' aGp~^Ac/Ac 

= E E ilia+ K)n Biz,r,)\\zeBr^) 

w' aSp“^Ac/Ac 

= E E(|(a + Ac)ni3(2,r,)| 

w' aep“iAc/Ac 

= p-"5^E(|p-iAcni3(z,r,)| \zeBr,), 


where the last equality follows from the faet that the set of eosets {a + Ac|a e p ^Ac/Ac} form a 
partition of p~^Ac. Sinee the number of eompeting w' is less than and |p“^Ac fl i3(z, rz)| = 

|Ac n;B(pz,prz)|, 

Pi£\z e BrJ < {\Acn Bipz,pr,)\ \z e Br^) . 

Using Lemma we bound the number of lattiee points inside the ball Bipz,prz), and obtain 

?(S\z eB„)< (rc„v(A,) + prj". 

Vol(Ac) 

Using the bounds pi^-^A < from ( [T3] ); Vol(Ac) > Vn iAA from ( fT4l ); Oz < 

from ( |2^ ; and the relations rcov(Ac) = \fn, Vz = y/n(l + 5)crJ, and 1 + 5 = 2'/^ we obtain 


Fi£\ze BrJ 


K. 


14n"/2 2-"V4 
^nRlK—M) /i / 


n + p^n(l + 6)a‘^ 


n 


V2 


+ ^/n{l + 6)a{j 


onR(K-M)o^^/4: / 

p 


2nR(K—M)2J''^/i 


n 


p2 


nl\l + 5)"/^ 


p\/(l + 5) 


z2^-{R+e){K-M) 


^ I 2 — — Al) 

\p 

2nR{K—M)2p'!/2 

+ 1 


< 


2n(i?+€)(A'-M) \^p2(^+")(^-^) 

‘2nR{K—M)c2^^/2 


2 nR{K-AI) 2 nt{K-A[) \ ^ 

Sinee JL — M > 1, we have > 2”% and henee, 

Fi£\z eBA< 2-”'/= ' ^ 


p2{R+e){K-M) 


+ 1 


+ 1 
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Using the inequality ( [T^ , whieh immediately follows from the choiee of the prime integer p, we have 

V{8\z e Br,) < 

Note that this upper bound holds for all suffieiently large values of n. ■ 

We will now combine the result of Theorem with ( [35] ) and ( [3^ , and upper bound the error probability 
P 5 at the receiver (S', as 

P 5 < P(^) + P(-2 iBr^)+V{£\ze BrJ. 

Using Theorem Lemma and ( [T 8 | ), we obtain 


P < 2-"/2 + 


exp 


n{5- ln(l + 5)) 


1 + exp I- — 1+2 ' , 


(39) 


for sufficiently large n. Let cXmin > 0 be the least noise standard deviation a among the finitely many 
receivers in the multicast channel. Then we have Also, 5 — ln(l + 5) > 0 as long as 

S = — 1 is positive. Consequently, the parameter 


e = min loge 


5 - ln(l + 5) 


,loge 


a. 


2. 52 

in ^ 


e 

’4 


is positive, and the value of each of the terms on the right-hand side of ( |39| ) is at the most 2 . Hence 

the error probability at the receiver (S, a'^) can be upper bounded as 


Ps < 4 ■ 2- 


(40) 


for all sufficiently large n. We remark that the minimum required value of n for this upper bound to hold 
is a function of only e, and is independent of the side information matrix S. 


C. Completing the proof of the main theorem 

The bound ( |40l ) shows that the error probability for a fixed side information matrix S, averaged over 
the random code ensemble, tends to 0 as the code dimension increases. Hence, there exists a choice of 
lattice code (which is chosen for the given side information matrix S) with a small error probability at this 
receiver. We want to prove a slightly stronger result, viz., there exists a lattice code such that the decoding 
error probability for every possible side information matrix S is small as long as the receiver SNR is large 
enough. In order to prove this result, we consider a hypothetical multicast network that consists of one 
receiver for each possible choice of the matrix S. Note that two distinct values of the matrix S that have 
identical row space constitute equivalent receiver side information configurations. Hence, it is enough 
to consider a multicast channel that consists of one receiver corresponding to each possible subspace of 
F^, where the dimension of the subspace can be between 0 and K — 1. A subspace of dimension M, 
D < M < K — 1, can be mapped to a M x K matrix whose rows form a basis of the subspace. This 
map embeds the set S of all non-equivalent choices of side information matrix S into which 

is the set of all matrices over Fp with K columns and at the most K — 1 rows. Hence, the number of 
receivers |5| can be upper bounded as 

|5| < - ^p^"- 

M=0 M=0 

We assume that each receiver (5, cr2), S e S, satisfies the lower bound ( [T0| ) on SNR and outputs an 
estimated message vector w{S) using its own channel observation. We say that the multicast network is 
in error if any of the receivers commits a decoding error. Using a union bound argument and the upper 
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bounds ( |40l ) and ( |4T] ), we see that the network error probability P 
of lattice codes satisfies 


net 


averaged over the random ensemble 


Pnet = P (network error) 



< P ({u;(5) ^ w}) = ^ P5 < 2—, 

se-s se-s 

(42) 


which tends to 0 as n becomes arbitrarily large. Hence, for every sufficiently large n, there exists a lattice 
code such that the network error probability is as small as desired. In particular, this implies that there 
exists a choice of lattice code such that the decoding error probability at every receiver S E S, 

is simultaneously small. This completes the proof of the main theorem. 


D. Corollaries 

1) Almost all lattice codes are good: Using standard arguments based on Markov inequality 0, Q, 
[ |46| , we show that almost all codes from the random lattice code ensemble yield a small error probability. 
In order to prove this, it is sufficient to show that for almost all lattice codes the network error probability 
is small over the hypothetical multicast channel that consists of one receiver for each possible side 
information matrix. 

For a given dimension n, all the lattice codes in the random code ensemble use the same coarse lattice 
Ac, but differ in the choice of the fine lattice A and/or the dither vector d. Let 


X{A,d) = P(network error |A,d) 


denote the network error probability for a given choice of A, d in the hypothetical multicast channel. If A 
and d are chosen randomly, then X is a random variable. From ( |42l ), we know that the expected value of 
X, which is equal to the average network error rate Pnet, is small. Suppose we want a lower bound on 
the fraction of random codes with error probability at the most Using Markov inequality, we have 


\ / — o_ne:/2 — 


2—ne/2 




It follows that, asymptotically in n, for almost all choices of the fine lattice A and dither vector d, the 
resulting lattice code (A—<i)/Ac provides an exponentially small error probability in the multicast channel, 
i.e.. 


lim P (X < 2-""/") = 1. 


2 ) Goodness in single-user AWGN channel: Our model of multicast channel includes as a special case 
the single-transmitter single-receiver AWGN channel with no side information at the receiver, i.e., number 
of messages K = 1, side information matrix S is the empty matrix and rank(5') = M = 0. The decoder 
for this receiver uses the ^ identity matrix for A5 and the all zero vector for v, see ( [2T] ). Specializing 
the main theorem for a single receiver with M = 0, we immediately deduce that the ensemble of random 
lattice codes achieves the capacity of the single-user AWGN channel and hence arrive at Corollary 
It is well known that (nested) lattice codes, and lattice constellations in general, can achieve the capacity 
of the point-to-point AWGN channel [|7|-pT[. Our corollary to the main theorem provides an alternate 
proof of this result which is based only on simple counting arguments. 

The proof technique presented in this paper relies on lattices obtained by applying Construction A 
to random linear codes over a large enough prime field Fp. This technique was introduced by Loeliger 
in [401 and used in @-[101 to prove the goodness of lattice codes in AWGN channel. Each of these 
results requires a different choice of the prime p and places different requirements on the characteristics 
of the coarse lattice Ac. The following are some of the properties that have been used in the literature: 

• Rogers-good: the ratio of covering radius rcov(Ac) to the effective radius refr(Ac) of the lattice must 
be close to 1, see @. Such a lattice is also said to be good for covering. 
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MSE-good: the value of the lattiee parameter 


nVol(Ac)i+"/'* JV(Ac) 


fv(A ) IklPdx, known as the normalized 


second moment, is elose to V27re, see [451. Every Rogers-good lattiee is also MSE-good, and henee, 
this is a weaker requirement. 

• Poltyrev-good: sueh a lattiee, when used as an infinite eonstellation, aehieves the eapaeity of an 
AWGN ehannel in whieh the transmitter has no power eonstraints [451, \^\. These lattiees are 
resilient against additive Gaussian noise. 

The aehievability result of Q requires Ac to be simultaneously Rogers-good and Poltyrev-good, and uses 
p = 2”^, i.e., the prime field Fp used for Construetion A varies with the dimension of the lattiee eode and 
the size of the field inereases exponentially in n. The random eode ensemble of Q uses an MMSE-good 
lattiee for Ac, lets p grow as n^'^, and ean aeeommodate a wide elass of ehannel noise statisties, ineluding 
white Gaussian noise. The eode eonstruetion of [10| requires p to be at least needs no dithering 
operation, i.e., uses d = 0, but is known to aehieve eapaeity only if SNR > 1. In eomparison, our proof 
method uses a fixed (albeit large) value of p and holds for any SNR > 0, while requiring that Ac be 
Rogers-good. 


V. Conclusion 

We have showed that lattiee eodes are optimal for eommon message broadeast in Gaussian ehannels 
where reeeivers have side information in the form of linear eombinations of souree messages. We used 
random lattiee ensembles obtained by applying Construetion A to linear eodes over appropriately large 
prime fields Fp. The lower bound p > max {T!'^ — on the value of p does not neeessarily 

pose a limitation in eommunieation applieations. For instanee, in the relay network of Example the 
first phase of the protoeol, namely eompute-and-forward ffTsl, only requires that —)■ 0 as n —)■ oo. 


whieh ean be met by our seheme by varying p with the dimension n: for instanee, by ehoosing p to be 
the smallest prime greater than or equal to for a fixed (3 > 1. This will also ensure that the inequaliW 
p > max {2^^^, (2'/'‘ — 1)“^2“^} holds for all suffieiently large values of n. Similarly, with Example 0 
where the broadeast signal supplements a wired multieast network, it is known that wireline network eodes 


meeting the max-flow bound exist over every large enough finite field [26|. Henee, we ean ehoose p to 


be suffieiently large to simultaneously optimize both the wired and wireless parts of the hybrid network. 
On the other hand, designing lattiee strategies for a fixed small size of the finite field, espeeially sizes 
that are powers of two, may have greater praetieal signifieanee. 

The eapaeity of the Gaussian broadeast ehannel with reeeiver side information under general message 
demands, sueh as with private message requests, is known only for some speeial eases [[29|, [[48|, [|49|. 


The proofs for aehievability in these eases utilize ensembles of eodebooks generated using the Gaussian 
distribution together with dirty-paper and superposition eoding. It will be interesting to examine if the 
lattiee strueture of the eodes proposed in this paper ean be exploited to derive new eapaeity results beyond 
the known eases. 


Appendix I 

Capacity of the Common Message Gaussian Broadcast Channel with Coded Side 

Information 

Consider the problem setup with a single transmitter and N reeeivers (5i, af),..., (Sat, as deseribed 
in Seetion[ri-A[ We now provide a sketeh of the proof that C, defined in Q, is the eapaeity of this ehannel. 


A. Converse 

Suppose there exists a eoding seheme that aehieves rate R in the multieast ehannel with vanishing 
deeoding error probability at all the reeeivers. Eet the seheme transmit one realization of (wi,... ,wk) 
for every n ehannel uses, i.e., R = log2p. From ([T]) the eonditional entropy of eaeh realization of 
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(wi,..., wk) at the i* receiver {Si, af), given the corresponding coded side information, is {K—Mi) loga p. 
The per-channel use conditional entropy of the message is thus 


{K - Mi)\og2P 

K 


{K - Mi) R. 


In order to guarantee reliable communication it is necessary that the mutual information between the 
channel input at the transmitter and the channel output at the receiver be greater than the conditional 
entropy {K — Mi) R. Since the input power is constrained to be at the most 1 and the noise variance at 
the receiver is af, the maximum mutual information is V 2 log 2 (1 + V®"?)’ hence we have 


{K - M,) R < -log. 



or equivalently R < - -r ■ - log, 

^ ^ {K-Mi) 2 


Considering all the N receivers we immediately deduce that R < C. 



B. Achievability 

The proof of achievability closely follows the proof of Theorem 6 of CD and the standard textbook 
argument used for the achievability of the capacity of single-user AWGN channel. Let e > 0 be any 
constant. For a given code length n choose the message length i as the largest integer such that the rate 
R = ^In log 2 p satisfies 


1 1, f l-e\ 

{K - Mi) 2 af J 

As n —cx), it is straightforward to show that R converges to the right-hand side of the above inequality. 
For each of the 2"^^^ message vectors w = (iu|,.. . ,w]^)^ e associate a codeword x{w) e M" each 
of whose components are generated independently using the Gaussian distribution with zero mean and 
variance (1 — e). These 2"^^ vectors constitute the randomly-generated n-dimensional codebook X. 
Encoding: If the source message is w, the transmitter broadcasts the vector x{w) over n channel uses. 
Decoding: Consider the receiver {Si, af) that observes the channel output yi and the side information 
Yl!k=i m = 1,..., Mi, where Si = As in Q, the receiver determines the subcode 

'^sub = ^{Si,Ui\ - ■ ■ ,UMi) of codebook A that corresponds to the set of all message vectors w 
which are consistent with the observed coded side information. Among the codewords in A’sub, the decoder 
chooses the vector that is jointly (weakly) e-typical with yi. If there exists a unique such codeword x{w) 
that additionally satisfies the power constraint ||a;(u;)|p < n, the receiver declares w as the decoded 
message. Otherwise the receiver declares a decoding error. 

Given that T^ub consists of vectors generated independently using the Gaussian distribution 

with zero mean and variance (1 — e) and 

{K - Mi) R<^ log, -f - 3e, 


it is routine to show that the probability of decoding error at the receiver, averaged over the ensemble 
of codebooks, decays exponentially with code length n [50, proof of Theorem 10.1.1]. It follows that 
the probability that any of the N receivers commits a decoding error is also exponentially small in n. 
Hence, there exists at least one codebook X that transmits each message at rate R with the decoding 
error probability at all the receivers as small as desired. Letting n —>■ oo and e —0, we observe that any 
rate R < C is achievable. 
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